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Abstract 

Confounding of three binary-variables counterfactual model is discussed in this 
paper. According to the effect between the control variable and the covariate vari- 
able, we investigate three counterfactual models: the control variable is independent 
of the covariate variable, the control variable has the effect on the covariate variable 
and the covariate variable affects the control variable. Using the ancillary infor- 
mation based on conditional independence hypotheses, the sufficient conditions to 
determine whether the covariate variable is an irrelevant factor or a confounder in 
each counterfactual model are obtained. 

Keywords: Causal effect; Independence hypothesis; Counterfactual model; 
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1 Introduction 

Causal inference has become an important research field in statistics, data mining, epi- 
demiology and machine learning in recent decades (Kleinbaum et al.., 1982; Rothman, 
1986; Greenland, Robins and Pearl, 1999; Zheng et al, 2001; Geng et al, 2004; Xie 
and Geng, 2008). Confounding and confounder are two basic concepts for causal infer- 
ence (Kleinbaum et al.., 1982; Greenland and Robins, 1986). Several models have been 
presented for causal inference, two of which are the causal diagram model and counter- 
factual model(Rubin, 1974; Holland, 1986; Geng et al, 2004). 

In the presence of confounding bias, the effect of exposure on the rate of a disease 
can not be assessed correctly. By Greenland and Robins (1986), there are basically 
two main approaches for assessing confounding and confounder. One approach, which 
is called 'collapsibility-based', regards confounding bias as arising from difference be- 
tween stratified measures of association and the corresponding original measure. The 
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other, which is called 'comparability-based', regards confounding bias as arising from the 
exposed and unexposed populations which are not comparable. The comparability-based 
approach determines a factor to be a confounder if adjusting for it can reduce confounding 
bias (Greenland and Robins, 1986; Greenland, Robins and Pearl, 1999). Geng et al. (2002) 
mainly discussed the confounding of multi-value variables and give the criteria for con- 
founding. Zheng et al. (2002), Liang and Zheng (2003) discussed the identifiability of the 
causal effect of two kinds of counterfactual models using the independence hypotheses 
respectively. 

Traditionally, a confounding variable (the precise definition of a confounder) is a vari- 
able which is a common cause of both the control variable and the response variable 
(Wunsch, 2007). Whether the covariate variable which is not a common cause of both the 
control variable and the response variable in three binary-variables counterfactual models 
must not be a confounder? Kleinbaum et al. (1982), Greenland and Robins(1986) gave a 
qualitative definition of confounder: controlling a variable can reduce confounding, then 
the variable is called a confounder. So the covariate variable, which is not a common 
cause of both the control variable and the response variable but must affect the response 
variable, may be a confounder. In this paper, according to the precise definition, one 
model as shown in Figl is discussed: the covariate variable affects the control variable 
and the response variable at the same time, and the control variable affects the response 
variable. By the qualitative definition, we investigate other two models: one, as shown in 
Fig2, is that the control variable has the effect on the covariate variable and the covariate 
variable affects the response variable; the other, as shown in Fig3, is that the control 
variable is independent of the covariate variable and the covariate variable affects the 
response variable. Obviously, the third model is the special case of the other two models 
with independence of the control variable and the covariate variable. Then we use the 
formal definitions of a confounder and an irrelevant factor in Geng et al. (2002) and the 
ancillary information based on conditional independence hypotheses (Zheng et al, 2002; 
Liang and Zheng, 2003) to discuss the confounding of above-mentioned counterfactual 
models. 

The rest of the paper is organized as follows: In section 2, we introduce the main 
notation and definitions. In section 3, confounding of three kinds of three-binary- variables 
counterfactual models are discussed respectively. Section 4 concludes the paper. 

2 Notation and definitions 

Let E, D, C be binary variables. Let the control variable E be an exposure with the 
values e and e representing "exposed" and "unexposed" respectively. Let the response 
variable D be an outcome with the values and 1 denoting the presence or absence of a 
disease, where D e is the corresponding response when E = e and D s is the corresponding 
response when E — e , both of which take values 1 or denoting the presence or absence 
of a disease. Let C be a covariate variable with possible values or 1. 

Many kinds of studies focus on the effects of exposure on the rate of a disease in the 
exposed population. Let P(D S = 1\E = e) and P(D e — 1\E — e) be the proportions 
of diseased individuals in the unexposed population and the exposed population. Let 
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P(D E — 1\E — e) be the hypothetical proportion of individuals in the exposed population 
who would have attacked by the disease even if they had not been exposed. Since P(D S = 
1\E — e) is a hypothetical proportion, the model (Rubin, 1974; Holland, 1986) is a 
counterfactual model. 

In order to identify the casual effect of exposure on response, confounding bias B is 
defined as the difference between the hypothetical proportion of diseased individuals in 
the exposed population (Wickramaratneand and Holford, 1987; Holland, 1989), that is 

(2.1) B = P(D S = 1\E = e) - P(D S = 1\E = e). 

If B = 0, then there is no confounding. 

By the common standardization in epidemiology (Miettinen, 1972; Kleinbaum, 1982; 
Rothman, 1986; Geng et al, 2002), the standardized proportion P A (D S = 1\E = e) which 
is obtained by adjusting the distribution of C in the unexposed population to that in the 
exposed population is 

P A (D- e = 1\E = e) 
( 2 - 2 ) = £ P{D~ e = l\E = e,C = k)P{C = k\E = e). 

fc=0 

Definition l^. A covariate C is a confounder if 

(2.3) \P(D e = 1\E = e) - P A (D S = 1\E = e)\ < \B\. 

From the definition, we find that the standardized proportion P/\{D E = 1\E — e) obtained 
by adjusting for the irrelevant factor is closer to the hypothetical proportion P(D S = 
1\E = e) than the observed proportion P(D S = 1\E = e). 
Definition 2^. A covariate C is an irrelevant factor if 

(2.4) P A (D e - = l\E = e) = P{D- e = 1\E = e). 

Since the estimation of the hypothetical proportion is still unchanged after being ad- 
justed for an irrelevant factor, we do not need to adjust for it to reduce confounding bias. 

Lemma 1. If a covariate C is an irrelevant factor, it must not be a confounder. Ob- 
viously, the inverse negative proposition is also true. 



Proof. From the condition that C is an irrelevant factor, we can obtain that 

P A (D S = 1\E = e) = P(D S = 1\E = e). 

Then, 

\P(D e = 1\E = e) - P A (D e = 1\E = e)\ 

= \P(D e = 1\E = e) - P{D E = 1\E = e)\ = \B\. 

So, C is not a confounder. 

From the condition that C is a confounder, we can obtain 

\P(D e = 1\E = e) - P A (D S = l\E = e)\ 

< \B\ = \P{D- e = 1\E = e) - P(Dg = 1\E = e)\. 



3 



Then, 

P A (D S = l\E = e) ± P(D S = 1\E = e). 

So, C is not an irrelevant factor. 

However, the converse proposition of lemma 1 is not always true. For example' 7 ', let a 
factor C express groups categorized by every 10 years of age, and its values 1, 2, 3 and 4 
denote the original age groups 20-29, 30-39, 40-49 and 50-59 years respectively. Suppose 
that there is no exposure effect, i.e. there are only individuals of type 1 (individual 
'doomed') and type 4 (individual immune to disease), and that the joint distribution of 
disease, exposure and a factor C is given in Table 1. Here' 7 ] 

P(D S =l\E = e) = 0.52, P(D g = 1\E = e) = 0.58, 
\B\ = 0.06. 

When the individuals are regrouped by younger than 50, we can obtain a coarse sub- 
population given in Tablel. 



Tablel 



Type 


Distribution for the values of C 


Ce {1,2,3} 


Ce{4} 




E = e 


E = e 


E = e 


E = e 


1 ('doomed') 


133 


122 


23 


52 


4('immune') 


117 


78 


27 


48 


Total 


250 


200 


50 


100 



Then, 



So, 



P A (D S = l\E = e) ~- 

122 250 52 
x — + 



200 300 100 300 



P(D- e = l\E = e) 
50 

x — = 0.595 ^ 0.58. 



\P(D e = l\E = e) 
= 10.595 — 0.521 = 



-P A (D g =l\E 
0.075 > \B\. 



To sum up, C is not a confounder, but it is not an irrelevant factor. 

What's more, the negative proposition of lemma 1 is also not always true. For 
example' 7 ], when the individuals are regrouped by younger than 40 but older than 30, 
we can obtain a coarse subpopulation given in Table2. 



Table2 



Type 


Distribution for the values of C 


Ce {1,3,4} 


Ce{2} 




E = e 


E = e 


E = e 


E = e 


1 ('doomed') 


46 


26 


110 


148 


4('immune') 


54 


24 


90 


102 


Total 


100 


50 


200 


250 
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Then, 

P A (D S = 1\E = e) = P(D S = l\E = e) 

26 100 148 200 „ r „ o . n ro 

= — x + x = 0.568 ^ 0.58. 

50 300 250 300 T 

So, 

\P(D S = l\E = e)- P A (D S =l\E = e)\ 
= |0.568 -0.52| = 0.048 < \B\. 

To sum up, C is not an irrelevant factor, but it is not a confounder. 



3 Confounding of counterfactual model 

Considering the confounding of three binary-variables counterfactual model, there are 
three counterfactual models as follows: 



E{e,e] 

Q- 



C{0, 1} 

-o 



E{e,e] 

Q- 



C{0,1} 

■D 



E{e,e] 

Q 



C{0,1} 

D 



O 

£>{0,1} 
Figl.The first model 



O 

D{0,1} 



o 

D{0,1} 



Fig2.The second model Fig3.The third model 



To discuss whether there be confounding in our considering models, we use the con- 
ditional independence hypotheses as follows as the ancillary information(CK): 

(1) E±D e . 

(2) E±D B \C = 0. 

(3) E±Dg\C= 1. 

(4) E±C. 

{5)De±C. 

(6)D S ±C\E = e. 
{7)D- e LC\E = e. 



3.1 The first model 

As shown in Figl, C has effect on E and D at the same time, and C affects E. In order 
to calculate simply, suppose: 

P(C=l)=t, 

P(E = e\C = 1) = ai, P(E = e\C = 0) = a , 
P(Dg =l\E = e,C = j) = bj, 
P(D e = 1\E = e,C = 0) = 
P(D g = 1|E = e,C = 1) = Ml, 
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where 

i = 1 — t, a = 1 — a , di = 1 — ai,bj = 1 — bj. 

t, do, a±,bj can be observed from original data, but Uq,Ui can not be observed because 
they are hypothetical proportions. 

Then, 

P A (D S = l\E = e) 

= E P{D~e = l\E = e,C = k)P{C = k\E = e) 

k=0 

boa t + bia,\t 

dot + CL\t 

P(D s = l\E = e) 

= E p ( D e = l\E = e,C = k)P{C = k\E = e) 

k=0 

UoCL()t + Uidit 
dot + dit 

P(D s = l\E = e) 

= E p ( D e = l\E = e,C = k)P(C = k\E = e) 

k=0_ _ 

b d t + bidit 
dot + dit 

B = P(D B = 1\E = e) - P(D S = 1\E = e) 
b d t + b±dit b a t + bidit 

dot + ait dot + d\t 

Now translate each condition of (!K) into parameter form: 

, lX . 6 a t + fciait Mb^+Mi* 

{1).E _L L> g «- e - = — — — z 

dot + ait dot + dit 

(2) .E± L>e|C = i.e. m = ^o 

(3) .£ JL L>e|C = 1 i.e. wi = 6i 

(4) .C 1 De i-e. tioa + b d = Uidi + &iai 

(5) .C± £> g |£ = e i.e. 6 = &i 

(6) .C _L D S \E — e i.e. uo = ui 

(7) .C _L i.e. a = ax 



So, 



Theorem 1. If 

(a)£±C or 
{b)D g ±C\E = e or 
{c)D e ±C\E = e,E±D s \C 

The covariate C is an irrelevant factor. 
Proof. 
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In order to prove C is an irrelevant factor, we only need to prove 

bodoi + bidit bodoi + bidit 
a^i + a\t dot + dit 

That is, 

(b - h)(a - oi) = 0. 

(a) . From the condition E±C, we can obtain 

o = Oi. 

So, 

P A (£>g = 1|£ = e) = P(D- e = 1\E = e). 

(b) . From the condition D g ±C|i? = e, we can obtain 

b = h. 

So, 

P A (D S = 1\E = e) = P(D e = 1\E = e). 

(c) . From the condition P_LD g |C, we can obtain 

E ± D S \C = i.e. uo = b , 
E JL L>g|C = 1 i.e. m = b ± . 

Furthermore, 

C _L -Dg|P = e i.e. it — u i- 

Thus, 

6 = 6i. 

So, 

P A (D g = 1\E = e) = P(D- e = 1\E = e). 

Theorem 2. If 

(a) £±£> g or 

(b) C±D e \E = e,E±D s \C or 

(c) E±De\C = 0, C±D e \E or 

(d) ELD s \C = 1, C±De\E or 

(e) E±D s \C,E±C 

There is no confounding. 
Proof. 

(a). From the condition E±D S , we can obtain 

bodoi + bidit _ b d i + bidit 
d i + dit dot + dit 
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Then, 

b a t + biait b d t + bidit 

D = = = = U. 

a t + ait dot + ait 

(b) .From the condition that E±D g \C, we can obtain 

E JL D S \C = i.e. w = &o, 
£ JL D e -|C= 1 i.e. «i = 61. 

Furthermore, 

C JL L>g|£ = e i.e. 6 = 61. 

Then, 

& a t + Mi* Mo* + Mi* 

Jd = = = — = U. 

ciot + ait dot + ci\t 

(c) .From the conditions that CLD S \E, we can obtain 

C _L D g \E = e i.e. b = h, 
C _L D g |-E = e i.e. ito = u\. 

Furthermore, 

E JL D g |C = «.e. m = 6 - 

So, 

n Mo* + Mi* Mo* + Mi* , 

£> = = = — — = = u — b = (J. 

ciot + ait aot + ait 

(d) .From the conditions that C-LD S \E, we can obtain 

C JL D g \E = e i.e. b = h, 
C J_ -Dgl-E = e i.e. it = u±. 

Furthermore, 

E JL Dg|C = 1 i.e. mi = 61. 

So, 

Mo* + Mi* b a * + Mi* 

B = = = — — = = ui — bi — 0. 

d t + ait ao* + ait 

(e) .From the condition that E.LD S \C, we can obtain 

E JL Dg|C = i.e. w = 60, 
£ JL £> g |C= 1 i.e. u x = h. 

Furthermore, 

C _L E i.e. do = d\. 

So, 

Mo* + Mi* Mo* + 61 ait 

£5 = = = = U. 

agt + ait aot + ait 
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3.2 The second model 

As shown in Fig2, E and C has effect on D at the same time, and E affects C. In order 
to calculate simply, suppose: 

P(E = e) = a, 

P(C = 1\E = e) = Ci,P(C= l\E = e) = c , 
P(£>e = l\E = e,C = j) = bj, 
P(D g = l\E = e,C = 0) = u , 
P(D g = l\E = e,C= 1) =ui, 

where 

a = 1 — a, do = 1 — Co, Ci = I — ci, 6-,- = I — 6j. 

a, ft, can be observed from original data, but c , Ci, w , «i can not be observed because 
they are hypothetical proportions. 



Then, 



P A (D g = 1\E = e) 



So, 



= E p ( D e = l\E = e,C= k)P{C = k\E = e) 
= b Ci + b 1 c 1 . 
P(D s = l\E = e) 

= E P{D~e = l\E = e,C = k)P{C = k\E = e) 

= UqCi + U\C\. 

P(D s = l\E = e) 

= E P{D~e = l\E = e,C = k)P{C = k\E = e) 

k=0 

= 6 co + bic Q . 



B = P(D S = 1\E = e) - P(D S = 1\E 
= UqCx +u 1 c 1 - b c -hc . 



Now translate each condition of {"K) into parameter form: 

(1) .E _L D E i.e. u$ci + U\C\ = b c + bic 

(2) .E JL D S \C = i.e. w = b 

(3) .E JL D e -|C = 1 i.e. «i = 6i 

. M coa + ^oCia Uicia + &ic a 

(4) .C7±Dg «.e. — = — —— = — 

Cia + c o c a + c x a 

(5) .C JL D g \E = e i.e. b = h 

(6) .C J_ D S \E = e i.e. uq — u\ 

(7) .C±E i.e. c = c x 

Theorem 3. If 
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{a)E±C or 
(b)D s ±C\E = e or 
{c)D e ±C\E = e,E±D s \C 

The covariate C is an irrelevant factor. 
Proof. 

(a) . From the condition E±C, we can obtain 

Cq—C\. 

So, 

Pa(D s = 1\E = e) = 6 ci + 61 ci = b c + bic 
= P(D e = l\E = e). 

(b) . From the condition D g -LC\E = e, we can obtain 

b = h. 

So, 

P A (D S = 1\E = e) = fo ci + 61C1 = 6o(ci + ci) 
= 6 (c + c ) = 6 Co + h c = P(De = 1|£ = e). 

(c) . From the condition D S -LC\E = e, we can obtain 

E±D g \C = i.e. 60 = 61, 



Furthermore, 
So, 

Theorem 4. If 



De±C\E = e i.e. b = b t . 

P A (D S = 1\E = e) = P(D S = 1\E = e) 
{a)E±D s or 

(b) E±D s \C = 0, C±D S \E or 

(c) E±Dg\C=l,C±D g \E or 

(d) E_LD E \C,E_LC 

There is no confounding. 
Proof. 

(a). From the condition that E±D S , we can obtain 

UqCi + U1C1 = b c~o + 61 Co- 

Then, 

.B = u i + uit — b i — bit = 0. 
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(b). From the conditions that C±D S \E, we can obtain 

C _L D g \E = e i.e. bo = bi, 
C _L D g \E = e i.e. uq = u\. 



E JL D e -|C = i.e. u = b . 



Furthermore, 
So, 

B = u C! + u-iCi - b c - bic = u - b = 0. 
(c). From the conditions that CLD S \E, we can obtain 

C _L -Dg|-E = e i.e. bo = fri, 
C _L Del-E = e i.e. u = u±. 



E JL DglC = 1 i.e. ui = b ± . 



Furthermore, 
So, 

= u c~i + U\C\ — b c — biCQ — Ui — b\ — 0. 
(d). From the condition that E.LD S \C, we can obtain 

E _L Dg|C = i.e. -u = feo, 
£ JL D g C = 1 i.e. u t = 6i. 



Furthermore, 
So, 



C -L E i.e. cq — ci. 

B = uq! + i^t — 6 i — 

= (u - 6 )co + (uit - &i)cq = 0. 



3.3 The third model 

As shown in Fig3, both of and C have effects on D, and E±C. In order to calculate 
simply, suppose: 



P(£ = 


= e) = a,P(C = 


1) 


= t, 


P(D S 


= l\E = e,C = 


3) 


= bj, 


P(D S 


= l\E = e,C = 


0) 


= u . 


P{D- e 


= l\E = e,C = 


1) 


= U\. 



where 

a — 1 — a, i — 1 — t, bj — 1 — bj. 

a,t,bj can be observed from original data, but u ,ui can not be observed because they 
are hypothetical proportions. 
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Then, 

P A (D S = l\E = e) 

= E p ( D e = l\E = e,C = k)P(C = k\E = e) 

k=0 

= bot + ht. 
P(D e = l\E = e) 

= E P{ D e = l\E = e,C = k)P{C = k\E = e) 

= U t + Uit. 

P(D- e = l\E = e) 

= p ( D e = l\E = e,C = k)P(C = k\E = e) 

k=0 

= bot + ht. 

So, 

B = P(D S = 1\E = e) - P(D S = 1\E = e) 
= Uoi + uit — b i — bit. 

Of course, 

P A (D S = 1\E = e) = P(D S = 1\E = e), 

so the covariate C is an irrelevant factor, but not a confounder and cannot reduce con- 
founding. 

Now translate each condition of {"K) into parameter form: 

(1) .E _L D s i.e. uoi + uit = b t + bit 

(2) .E JL De\C = i.e. u = b 

(3) .E JL D S \C = 1 i.e. u ± = h 

(4) .ClD g i.e. b a + u a = bi<i + uia 

(5) .C JL D g \E = e i.e. b = h 

(6) .C J_ D S \E = e i.e. uo — u\ 

Theorem 5. If 

{a)E±D s or 

(b) E±D s \C = 0, E±De\C = 1 or 

(c) D g ±C\E, E±D S \C = or 

(d) D s ±C\E,E±D s \C = 1 

There is no confounding. 
Proof. 

(a). From the condition that E.LD S , we can obtain 

uoi + uit = boi + bit. 

Then, 

B = uoi + u\t — boi — bit = 0. 
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(b). From the conditions that 



E±D S \C = and ELD S \C = 1, 

we can obtain 

Uq = b and U\ —b\. 

So, 

B = u i + uit — b i — bit = 0. 

(c) . From the condition that D S -LC\E, we can obtain 

D S -LC\E — e i.e. bo = b±, 
C _L D S \E = e i.e. uo = u\. 

Furthermore, 

E JL D g \C = i.e. u = b . 

So, 

B = u i + U\t — b i — bit = u — b = 0. 

(d) . The proof is similar to (c). 

4 Conclusion 

Using the formal definitions of a confounder and an irrelevant factor, we discuss the 
confounding of three kinds of three binary- variables counterfactual models in epidemiology 
studies and statistics. The sufficient conditions of non-confounding and whether the 
covariate is an irrelevant factor or a confounder are discussed. The future work will 
extend the three variables counterfactual model to multi-variable counterfactual model. 
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